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ABSTRACT 

We present new measurements of the quasar angular autocorrelation func- 
tion from a sample of ~80,000 photometrically-classified quasars taken from the 
First Data Release of the Sloan Digital Sky Survey. We find a best-fit model of 
uj(8) = (0.066±oq24)^ _ ^°' 98±015 ' ) for the angular correlation function, consistent 
with estimates of the slope from spectroscopic quasar surveys. We show that 
only models with little or no evolution in the clustering of quasars in comoving 
coordinates since a median redshift of z ~ 1.4 can recover a scale- length con- 
sistent with local galaxies and Active Galactic Nuclei (AGNs). A model with 
little evolution of quasar clustering in comoving coordinates is best explained in 
the current cosmological paradigm by rapid evolution in quasar bias. We show 
that quasar biasing must have changed from 6q ~ 3 at a (photometric) redshift 
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of z p hot = 2.2 to 6q ~ 1.2 — 1.3 by z p hot = 0.75. Such a rapid increase with 
redshift in biasing implies that quasars at z ~ 2 cannot be the progenitors of 
modern L* objects, rather they must now reside in dense environments, such as 
clusters. Similarly, the duration of the UVX quasar phase must be short enough 
to explain why local UVX quasars reside in essentially unbiased structures. Our 
estimates of 6q are in good agreement with recent spectroscopic results (Croom 
et al. 2005), which demonstrate that the implied evolution in b Q is consistent 
with quasars inhabiting halos of similar mass at every redshift. Treating quasar 
clustering as a bivariate function of both redshift and luminosity, we find no evi- 
dence for luminosity dependence in quasar clustering, and that redshift evolution 
thus affects quasar clustering more than changes in quasars' luminosity. Our 
results are robust against a range of systematic uncertainties. We provide a new 
method for quantifying stellar contamination in photometrically-classified quasar 
catalogs via the correlation function. 

Subject headings: cosmology: observations — large-scale structure of universe - 
quasars: general — surveys 

1. Introduction 

Determining the distribution of matter (baryonic and dark), as a function of redshift, 
is a fundamental goal of cosmology, providing important information on the content of the 
Universe. Unfortunately, the majority of this matter is non-baryonic and we are forced to use 
tracers, like galaxies and quasars, to infer its presence. Study of these populations introduces 
the added complication of determining how such tracers are biased compared with underlying 
dark matter. Nevertheless, vast resources are devoted to mapping the distribution of such 
tracers to help infer the distribution of matter in the Universe. 

In the local Universe (z < 0.1), the distribution of galaxies (and thus inferred matter) 
has become increasingly constrained due to large redshift surveys such as the 2dF Galaxy 
Redshift Survey Colless et al. (2001) and Sloan Digital Sky Survey (SDSS; Gunn et al. 
1998; Lupton, Gunn & Szalay 1999; York et al. 2000; Hogg et al. 2001; Smith et al. 2002; 
Stoughton et al. 2002; Pier et al. 2003; Ivezic et al. 2004). Such surveys are now able to 
measure the density of dark matter to better than 10% and detect quite subtle features in 
the galaxy distribution, such as "baryon acoustic oscillations" (see, e.g., Tegmark et al. 2004; 
Eisenstein et al. 2005; Cole et al. 2005). In the distant Universe (z ~ 1), the distribution of 
galaxies is less constrained, as it requires extensive investments of telescope time on 8-meter 
class telescopes (e.g., DEEP2, Davis et al. 2003). At even higher redshifts (z > 1.5), our 
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knowledge of galaxies is limited by the redshifting of the bulk of their luminosity to infrared 
& submillimeter wavelengths (e.g., the GDDS survey, Abraham et al. 2004). 

When studying the distribution of matter at high redshifts (z > 1), quasi-stellar objects 
(quasars or QSOs) are a better tracer than galaxies, as they are extremely luminous and 
can be identified from current multi-color optical imaging out to a redshift of z ~ 6.5 (Fan 
et al. 2001). However, there are problems with using quasars as tracers of dark matter, for 
instance: 1) It has long been unclear how quasars are physically related to the underlying 
dark matter halos they inhabit, thus leading to uncertainty about their biasing schema. 2) 
Existing photometrically-selected quasar samples suffer from significant stellar contamination 
(~ 50%) and thus require laborious follow-up spectroscopy (see, e.g., Croom et al. 2004). 

In recent years, these two issues have begun to be addressed. In the case of quasar bias, 
it has become increasingly clear that all massive galaxies possess a central supermassive black 
hole (Richstone et al. 1998), which has a mass correlated to that of its parent dark matter 
halo (see Ferrarese & Merritt 2000; Gebhardt et al. 2000 etc.). Under the hypothesis that 
the quasar phenomenon is driven by these supermassive black holes, it is likely that quasar 
properties are linked to the evolution of underlying dark matter, as demonstrated in high- 
resolution simulations (e.g., Di Matteo et al. 2004). There have also been recent advances in 
the algorithms used to identify quasars. For example, Richards et al. (2004) have developed 
a Bayesian method to separate quasars from stars in the 4-dimensional color-space of the 
SDSS, with nominal stellar contamination as low as 5% for redshifts of 0.2 < z <2.4. 

The distribution of objects in the Universe is commonly quantified using the two-point 
correlation function (Totsuji & Kihara 1969), or its Fourier counterpart, the power spectrum 
of density fluctuations. The 2dF QSO Redshift Survey (henceforth 2QZ; Croom et al. 2004) 
has provided the most precise estimates (to date) of both the quasar power spectrum (Outram 
et al. 2003) and two-point correlation function (Croom et al. 2005). In particular, Croom 
et al. (2005) find the biasing of quasars evolves from 6q ~ 4.4 at z — 2.48 to 6q ~ 1.1 at 
z = 0.53 (see also Porciani, Magliocchetti & Norberg 2004), consistent with observations 
that local AGNs are unbiased with respect to the normal galaxy population (e.g., Wake et 
al. 2004). Together, these results constrain the mass and evolution of the dark matter halos 
that harbor QSOs, as well as the duration of the quasar phase (e.g., Grazian et al. 2004; 
Hopkins, Bahcall & Bode 2005). 

In summary, it is now possible to employ photometrically-classified quasars to investigate 
quasar bias. We introduce just such an application in this paper, in which we investigate the 
evolution of quasar clustering using the largest published sample of photometrically-classified 
quasars (Richards et al. 2004). We use photometric redshift estimates (Weinstein et al. 2004), 
inverting the angular correlation function to estimate the amplitude of quasar clustering in 
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real-space and the evolution of quasar bias with redshift. After discussing our data, and 
possible systematic uncertainties, we will measure the angular autocorrelation of our quasar 
sample. We will then present the first estimate of the clustering evolution of photometrically- 
classified quasars, also exploring luminosity-dependent clustering. Unless otherwise speci- 
fied, cosmological modeling in this paper assumes (Q m ,Q\,h = ifo/100km s _1 Mpc -1 ) = 
(0.3,0.7,0.7); consistent with WMAP results (Spergel et al. 2003), and magnitudes are cor- 
rected for Galactic extinction (using the dust maps of Schlegel, Finkbeiner & Davis 1998). 

2. Data and Methodology 

2.1. The KDE Data 

The data analyzed in this paper are from the photometrically-classified sample of Richards 
et al. (2004), which we henceforth call the KDE (Kernel Density Estimation) catalog. The 
KDE catalog is drawn from point sources with u — g < 1, (observed) g > 14.5 and (dered- 
dened) g < 21, that appear in the SDSS First Data Release (DR1; Abazajian et al. 2003). 
Separations, in 4-dimensional color-space, from the (spectroscopically-confirmed) quasar and 
stellar loci, are determined for each source. A Bayesian technique is used to classify each 
object as either "QSO" or "star". The resulting set of 100,563 quasar candidates is thus 
UVX-and-magnitude-limited in such a way as to be broadly comparable to the 2QZ. In gen- 
eral, we will refer to objects from the KDE catalog as QSOs, even though the vast majority 
of the KDE data have not been spectroscopically confirmed as quasars. The KDE catalog 
includes a photometric redshift (photoz) for each QSO (see Weinstein et al. 2004). 

2.2. The Random Catalog 

Estimating angular correlation functions requires a sample of random points that have 
the same angular selection function as the data surveyed (neglecting correlations that are 
due to QSOs tracing cosmological structure). To construct a random catalog that mimics 
the KDE data, we create a large set of random points distributed over the DR1 area. Points 
that fall in any SDSS imaging mask 1 are discarded. We assign each random point the seeing 
value of the nearest PhotoPrimary object in the SDSS database 2 and its absorption value 
from the Galactic dust maps of Schlegel, Finkbeiner & Davis (1998). Broad sky coverage of 



1 http://www.sdss.org/drl/products/image/use_masks.html 
2 http://cas. sdss.org 
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the KDE data is shown in Figure 4 of Richards et al. (2004), although the sample we use is 
also cut to the DR1 theoretical footprint, which discards 3.4% of the KDE data. The SDSS 
theoretical footprint differs from the actual sky coverage in the South, as curvature of the 
coordinate system forces drift-scanning beyond the targeted stripes. 



2.3. Correlation Function and Error Estimation 



We construct the two-point angular correlation function (u>) from counts of data-data, 
random-data and random-random pairs, via the estimator of Landy & Szalay (1993). We 
use logarithmic bins centered at each angular separation (9). The estimator is 



m = y + 1 (d 

where Q denotes a data point and R denotes a random point (see section 2.2). We use a ran- 
dom catalog 100 times larger than the data catalog, normalizing the pair counts accordingly, 
and only quote results for bins that contain at least 10 data points. 

We estimate errors using jackknife resampling (Scranton et al. 2002), which performs 
well across a range of scales (Zehavi et al. 2002; Myers et al. 2005). The jackknife method 
is to divide the data into N pixels, then create N subsamples by neglecting each pixel in 
turn. If we denote subsamples by the subscript L and recalculate ojl via Equation 1, then 
the jackknife error, o w is 



^) = E^y^)-^)] 2 ( 2 ) 

The RRl/RR term (Myers et al. 2005) weights by the different numbers of objects expected, 
due to holes, poor seeing or pixels that extend beyond the DR1 boundary. Throughout this 
paper, we jackknife-resample using 1 deg 2 pixels, sampling across thousands of realizations. 

In Figure 1, we display the autocorrelation of all objects in the KDE catalog, with 
jackknife errors. We also display the jacknife errors in ratio to: 1) Poisson errors (calculated 
via o" 2 = 2(1 + uj) 2 /QQ as only half of the QQ pairs are independent) and; 2) pixel-to- 
pixel errors (see Myers et al. 2003 for more on these errors). The error ratios in Figure 1 
illustrate that: 1) Poisson errors become systematically smaller than jackknife errors as the 
scale increases, and that; 2) the pixel-to-pixel error becomes ill-defined on the scale of a pixel 
(1 deg in Figure 1). We therefore use the jackknife error throughout this paper. 
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2.4. Modeling 

To fit models to the angular correlation function, we use a power-law (Peebles 1980) 

u(0) = AO' 5 (3) 

of amplitude A. The slope, 5, is canonically found to be 0.7—0.8 for galaxies (e.g., Connolly et 
al. 2002) although the amplitude depends on the bias of the galaxy type. Croom et al. (2005) 
suggest that 5, when averaged over 1 — 100 h~ l Mpc scales, ranges from 0.65 — 0.85 for QSOs 
(A-dominated cosmology) depending on whether redshift-space distortions are included in the 
model; they further note that a single power-law may not fairly represent QSO clustering, 
since QSO correlation estimates are capable of probing the non-linear, intermediate and 
linear regimes, which, according to perturbation theory, have different clustering amplitudes. 

3. Sources of Systematic Error 

We now address possible contaminants of our clustering analysis. Given the large areas 
we sample, we neglect the effect of the integral constraint, as it will be an order of magnitude 
or more smaller than our typical error (c.f. Scranton et al. 2002; Connolly et al. 2002). 

3.1. Star/Galaxy Classification Errors 

If galaxies that are normally resolved are imaged in poor seeing, they may be misclas- 
sified as point sources and included in the KDE catalog, impacting autocorrelation mea- 
surements. In Figure 2 we display the autocorrelation as a function of ((/-band) seeing for 
the entire KDE catalog. Seeing cuts of < 1.8 and < 1.3 arcsec remove, respectively, ~5% 
and ~70% of the KDE data. KDE objects with 2QZ spectroscopic matches suggest that 
contamination by galaxies not resolved in 1.3 arcsec seeing is extremely small (<0.2%). We 
have used the x 2 statistic to estimate the amplitude and slope of the correlation function 
for the data shown in Figure 2. Though there are very weak trends with seeing on the 
largest scales we consider, they are consistent with no dependence given the errors. As the 
hypothesis that seeing does not influence our model of QSO clustering is allowed within the 
errors, and intrinsic fluctuations in our correlation estimates are far larger, we enforce no 
seeing constraints throughout this paper. 
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3.2. 



Galactic Extinction 



Dust in our Galaxy causes QSO numbers to fluctuate but the dereddened magnitude 
limit of g < 21 imposed on the KDE catalog, coupled with the SDSS (95% detection) limit of 
g < 22.2 means that few QSOs should be obscured from the KDE sample. In the upper panel 
of Figure 3 we plot the autocorrelation of KDE objects as a function of absorption (A g ). The 
autocorrelation in A g bins resembles that of the entire catalog except for A g > 0.18, where 
Galactic extinction introduces excess clustering on large scales. Our binning in A g is chosen 
to ensure sufficient objects (~20,000) in each bin to study the clustering signal with some 
significance. We note that our A g < 0.18 cut is stricter than the A r < 0.2 cut suggested by 
Scranton et al. (2002), so it is possible that a less strict cut in A g exists that still rejects most 
of the dust-induced excess clustering. However, changing the bin resolution does not affect 
our conclusion that a cut of A g < 0.18 is sufficient to remove all of the large-scale power, and 
a less strict cut merely risks introducing a systematic without significantly improving the 
statistical precision of any clustering measurements. The lower panel of Figure 3 illustrates 
that the main cause of the dust-induced excess clustering power is objects that are both 
observed to have faint magnitude and that suffer high Galactic absorption. It is unclear 
if these objects exhibit large-scale clustering due to correlations between dust enshrouded 
regions of the Galaxy or because they are misclassified stars. Throughout our analyses, the 
KDE catalog and the random catalog are cut to regions with A g < 0.18, discarding ~20% 
of the sample. 



Based on classifying simulated QSOs, Richards et al. (2004) find that the KDE tech- 
nique is 95% efficient, although their Figure 6 suggests this efficiency is magnitude- dependent. 
Once other sources of potential clustering contamination, such as seeing and dust, have been 
eliminated, we can use clustering measurements to independently test this 95% efficiency 
claim. Further, we can test how 5% stellar contamination would impact the quasar autocor- 
relation. 

Under the substitution Q — > aQ + (1 — a)S, where a is the efficiency, or fraction of 
correctly classified quasars, it can be shown that Equation 1 becomes 



3.3. 



Stellar Contamination 



u(9) = a 2 u QQ {B) + (1 - a) 2 tuss(0) + e(9) 



(4) 



where uqq and ooss are the intrinsic autocorrelation of QSOs and stars, respectively, and e 
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is a tiny offset arising from cross-terms 

~QR + SR-QS 



e = 2(a 2 - a) 



RR 



- 1 



(5) 



As we model the uncontaminated quasar and star distributions, QR/RR, QS/RR and 
SR/RR should be ~ 1, so e ~ 0. We estimate limits on these cross-terms using: 1) (for 
quasars) all (A g < 0.18) KDE objects and; 2) DR1 stellar sources with 16.9 < g < 17.1, 
a range chosen to both match the quasar sample size and to ensure that few quasars are 
present to contaminate the stellar clustering signal. We have checked if loss is constant 
across the range 17 < g < 21, finding a slightly smaller amplitude at g ~ 21, which is 
insufficient to change any of our conclusions throughout this section, and may, in any case, 
be due to quasar contaminants. Over 1-60 arcmin scales we find |1 — (QR/ RR)\ < 0.005, 
|1 - (SR/RR)\ < 0.008 and |1 - (QS/RR)\ < 0.03. Summing these limits in quadrature 
implies |e| < 0.063(a 2 — a). As < a < 1, we conclude |e| < 0.016 as a strong upper limit. 
In this section, we assume e = but e = 0.063(a 2 — a) would not affect our conclusions. 

In the upper panel of Figure 4 we show the autocorrelation of (A g < 0.18) KDE objects 
in bins of g magnitude. There is no evidence of magnitude-dependent effects. Stellar con- 
tamination would be signified by similar clustering on all scales, since a degree probes tens 
of parsecs for typical Galactic stars detected in the SDSS, but tens of Megaparsecs at QSO 
distances. For comparison we also plot the autocorrelation for stellar sources in DR1 in the 
range 16.9 < g < 17.1. At 30 arcmin u S s ~ 0.25, dropping to ~ 0.18 at 2 degrees. We can 
use Equation 4 to quantify stellar contamination (1 — a) in the KDE sample. In the lower 
panel of Figure 4 we plot the la upper limits on 1 — a, taking ljqq from all (A g < 0.18) 
KDE objects. Though any scale is valid for measuring the upper limit on 1 — a, the effects 
of any stellar contamination should be most apparent on larger scales (where ujqq — > 0). 
The lowest upper limit measured in each bin of g is 15.5%, 5.6%, 13.0%, 6.0%, 8.2%, 12.3% 
for bins with g of 19.10, 19.76, 20.13, 20.38, 20.63, 20.88, respectively, and the lower limit is 
consistent with zero in every bin. 

As a further test, we construct a sample of spectroscopic matches to the 2QZ. Unlike 
Richards et al. (2004), we do not consider matches to the DR1 quasar catalog (Schneider 
et al. 2003; henceforth DR1QSO), as it may be biased to quote efficiencies from data that 
the KDE algorithm was trained on. We also ignore matches to SDSS DR2 (Abazajian et 
al. 2004), which are selected similarly to DR1QSO. Our matches consist of quality "1" 2QZ 
QSOs (see Croom et al. 2004) within 2 arcsec of a KDE object. For (A g < 0.18) KDE objects 
with 2QZ matches, we find efficiencies of 97.9%, 97.3%, 95.7%, 93.9%, 90.7%, 88.8%, for g 
as above, and 95.1% across the full magnitude range. This is consistent with both Richards 
et al. (2004) and our measurements of a from clustering. We note that objects targeted by 
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several independent UVX methods are more likely to be quasars (the ratio of quasars to 
non-quasars for quality "1" 2QZ objects that overlap DR1 is 57:43). 

If the efficiency of the KDE sample is as high as our analyses suggest, we can measure 
uiqq without stellar contamination having an impact. Taking uss(2 degrees)=0.18, a con- 
tamination of 5% would cause an equal contribution from quasars and stars for uqq ~ 0.0005, 
which, will be comparable with our error on uo at 2 degrees. Similarly, a contamination of 
10% would cause an equal contribution for uoqq ~ 0.0022, which is negligible, given that 
when we bin our sample by magnitude the smallest error on u>(2 degrees) is 0.0074. 

4. The Projected Clustering of QSOs 

4.1. The KDE QSO Autocorrelation 

In Figure 5 we display the autocorrelation of our (A g < 0.18) KDE sample, together 
with best-fitting models (from Equation 3). The long-dashed line, a fit across "all" scales, is 
marginally rejected, with P(< x 2 ) — 0.27, which is expected as a single power-law is not a 
good fit to either spectroscopic quasar samples or CDM models (see, e.g., Croom et al. 2005). 
To the eye, Figure 5 suggests breaks at ~1 arcmin and ~25 arcmin. A small-scale break 
is worth investigating, as Zehavi et al. (2004) find power-law departures on 1 — 2 h" 1 Mpc 
scales for the projected galaxy autocorrelation. However, based on Poisson statistics, which 
are valid on small scales, (see Figure 1), there is no evidence for a break at 1 arcmin, as the 
data suggest uo{< 1 arcmin) = 0.131 ±0.036 and integrating the fitted model yields 0.103. If 
we model with a break at 1 arcmin, the fits are only slightly improved, with P(< x 2 ) = 0-80 
for 9 < 1 arcmin (the dotted line in Figure 5) and P(< x 2 ) = 0-36 for 9 > 1 arcmin (similar 
to the solid line in Figure 5). There is, however, marginal evidence for a break at 25 arcmin, 
as lu(< 25 arcmin) = 0.0086 ± 0.0013, compared to 0.0055 for the model. Corrected for 
the ratio between Poisson and jackknife errors (see Figure 1), this is a 1.7 'a fluctuation. 
Further, a power-law fit over 1-25 arcmin (the short-dashed line in Figure 5) provides an 
excellent fit of P(< X 2 ) = 0.98. We intend to repeat our analyses with a larger sample of 
photometrically-classified quasars, drawn from SDSS DR4, and will soon know whether this 
break persists in a larger sample. 

The regime that we will consider is 9 > 2 arcmin (> 0.75 h^ 1 Mpc at the sample's me- 
dian redshift, z ~ 1.4, as calculated from KDE objects with spectra in DR1QSO). We study 
these scales as, on average, they are dominated by points at 9 > 30 arcmin (> 10 h^ 1 Mpc; 
which should be in the linear regime of clustering) but can still provide meaningful con- 
straints from the KDE sample. Fits to the data are statistically unchanged by fitting from 
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2 arcmin out to any maximum scale in (at least) the range 40-250 arcmin (~14-89 hr x Mpc), 
as we will indirectly demonstrate in section 4.3. Over scales of 2 < 9 < 250 arcmin (~0.75- 
89 h~ x Mpc), our best-fitting power-law model has a slope of 5 = 0.98 ±0.15 consistent with 
Croom et al. (2005), who find (accounting for distortions that affect clustering along the 
redshift coordinate), a nearly acceptable power-law fit with slope 7 = 5+ 1 = 1.866 ± 0.060 
over 1 — 100 Yr 1 Mpc. Finally, we note that the largest-scale points we plot in Figure 5 are 
statistically consistent with being anti-correlated, in agreement with CDM models, which go 
negative around 70 — 100 hr 1 Mpc (e.g., Croom et al. 2005). Further, the bin plotted at 
~200 arcmin spans the range 56 — 140 hr 1 Mpc and is tantalizingly higher than adjacent 
bins. This scale is consistent with that expected for a baryon peak (see, e.g., Eisenstein et 
al. 2005) but the higher amplitude of the bin at ~200 arcmin is certainly not statistically 
significant in the current sample. 

4.2. Limber's Equation and the Evolution of the Quasar Correlation Function 

When the angular correlation is expressed as in Equation 3, the de-projected spatial 
correlation function can be written (Peebles 1980) 



where 7 is the power-law slope, r is the local spatial scale-length, and e parameterizes 
clustering evolution. In general, for e < 0, clustering diminishes with cosmic time, meaning 
objects with high redshift were more clustered. 

The spatial correlation function can be integrated to yield its angular projection (Limber 
1953). In the small angle approximation (0 C 1 radian), unknowns in Equations 3 and 6 
can be related (see Peebles 1980 for a full derivation) 





5 = 



7-1 



(7) 
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where H 1 = r(0.5)T (0.5[7 — 1]) /T(0.57), T is the gamma function, x is the radial comoving 
distance, dN/dz is the redshift selection function, and E z = H z /c = dz/dx- Strictly, x should 
be the angular, or transverse, comoving distance, however, in our chosen, flat cosmology, 
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radial and transverse comoving distances are equivalent. The Hubble Parameter can be 
found via 



H 2 z = HZ[n m (i + zf + n A ] 



(9) 



Perturbation theory suggests the mass correlation function is scale-dependent, transi- 
tioning from linear to non-linear scales at 10 — 20 h^ 1 Mpc at z ~ (e.g., Hamilton & 
Tegmark 2002), and at smaller scales at higher redshift. In the highly non-linear regime 
(<C 10 h^ 1 Mpc), the correlation function should evolve via stable clustering (e.g., Peacock 
& Dodds 1996). In the linear regime, clustering evolution can be parameterized by substi- 
tuting D z 2+7 ^ for (1 + z)~^ + ^ in Equation 6 — D z is often called the linear growth factor. For 
an Q m = fltotai = 1 cosmology, D z — (1 + z)~ l . For flat, A cosmologies, D z , is suppressed as 



9z 



1 



Dz go(i + z) 

where go normalizes to the fiducial case, and g may be approximated as 
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(Carroll, Press & Turner 1992). The cosmological parameters (in a flat cosmology) evolve as 
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(12) 



Using Equation 11 is 30-40 times faster than integrating the cosmological parameters and 
produces a g z /go ratio consistent with Equation 28 of Carroll, Press & Turner (1992) to 
0.2% or better for all redshifts (in our chosen cosmology). In the linear formalism, the 
de-projection can be derived by substituting D 2 Z for (1 + Equation 8. 

Given an estimate of diV / dz, we can determine ro, the local scale-length of the correlation 
function. KDE quasars are trained on DR1QSO colors, which are flux-limited in i, brighter 
than the g < 21 KDE limit. However, the KDE redshift selection to g < 21 should resemble 
that of UVX-selected quasars as KDE objects are weighted against the stellar locus to 
g < 21 and then undergo a UVX cut. Ideally, we would obtain spectra for a small, random 
sample of KDE objects to establish selection but in the absence of this information our best 
sample consists of all known UVX quasars in the field. We therefore determine dN/ dz from 
all (A g < 0.18) KDE objects with spectroscopic matches (in DR1QSO, DR2 or the 2QZ). 
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We have recreated our analysis using only the redshifts of DR1QSO matches and find our 
results are affected ~ 1%. Using only the redshifts of matches to DR2 quasars or to the 
2QZ affects our results ~ 3%. Such small changes are well within our random error. In 
Fig 6 we plot the spectroscopic redshift distribution we use, in comparison to that obtained 
assuming the photometric redshifts of the KDE sample are exact. The two histograms are 
broadly consistent, which is entirely to be expected as, in an ensemble sense, the colors of 
the spectroscopic and photometric quasar samples are broadly consistent. 

In Figure 7, we display the results of de-projecting the solid line in Figure 5 to obtain tq. 
In the upper panel of Figure 7, we plot tq(z), for a range of models in comparison to scale- 
lengths obtained for spectroscopically-confirmed quasars by Croom et al. (2005). In the lower 
panel of Figure 7, we compare to CNOC2 galaxies (Carlberg et al. 2000), as locally there is 
evidence that galaxies cluster like AGN (Wake et al. 2004). CNOC2 galaxies are luminous, 
so might be more clustered than average galaxies. Nevertheless, clustering of KDE objects 
and CNOC2 galaxies are most consistent for evolutionary models that predict little or no 
evolution in clustering. Although our clustering measurements, being normalized at z ~ 1.4, 
provide the strongest constraint when combined with local measurements, the Croom et 
al. (2005) data in the upper panel of Figure 7 further demonstrate that linear theory is 
increasingly inconsistent with quasar clustering at higher redshift. Linear theory predicts 
that dark matter, having had less time to collect under gravity, would be more clustered 
locally than at earlier times. We find, in contrast, that QSOs are better represented by a 
model where their clustering is nearly constant with redshift from z ~ 1.4 to z ~ 0. If 
linear theory is to be correct, this confirms a picture where QSOs were more clustered than 
underlying matter at high redshift, as has been argued for ACDM cosmogonies (Efstathiou 
& Rees 1988). Further, under the assumption of linear theory, the UVX quasar phase must 
be short, as is obvious from the dot-dash lines in the upper panel of Figure 7 — if we could 
see the same quasars locally as at z ~ 1.4 they would now be significantly more clustered 
than galaxies, having a scale-length of 12.8 ± 2.4 h^ 1 Mpc. 

This analysis was carried out using simple assumptions; Limber's Equation, the angu- 
lar distribution of KDE objects and the redshift distribution of spectroscopically-confirmed 
QSOs; however, it might be criticized for several reasons, most notably, QSO clustering, as 
we measure it, is probing a huge volume and being averaged across many different scales. 
Jenkins et al. (1998) have found that dark matter in ACDM simulations displays a local 
scale length of r (z = 0) ~ 5 Yr 1 Mpc, and has an approximate power-law form over scales 
of 1 — 20 hr l Mpc. If we restrict our analysis to these scales, then evolution of QSO clus- 
tering according to linear theory is still rejected at the 99.9% level, for a local scale length 
of ro = 5 hT 1 Mpc. Regarding this simple analysis as strong evidence that QSOs at high 
redshift were more clustered than the dark matter they traced, we will proceed by using 
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photometric redshift information to quantify this evolution in terms of bias. 



4.3. Angular QSO clustering in Photometric Redshift Bins 



Kaiser (1984) first discussed biasing schemes, when conjecturing that rich clusters form 
where the clustering amplitude of dark matter exceeds some threshold. Bardeen et al. (1986) 
extended the concept of bias to galaxies, or any object that formed in the rare peaks of a 
Gaussian random field. Though bias might be a complex function of formation processes, it 
is often represented by a simple linear factor, b, that should acceptably parameterize QSO 
clustering relative to underlying dark matter (i.e., £qq = 5 2 £). 

In the previous section, we modeled QSO clustering evolution via Equation 6, or it's 
linear theory equivalent, by assuming £qq = £. An alternative representation is 



allowing underlying dark matter to evolve according to linear theory even if r (z) does not, 
as QSO clustering traces a bias parameter that may evolve with redshift. 

Equation 13 can be de-projected as for Equation 6, but the results are now interpreted 
differently. If we assume ro(z — 0) =5 h^ 1 Mpc for the matter correlation function, as is 
appropriate for local dark matter in ACDM simulations (Jenkins et al. 1998), then 



where the z superscript indicates that these local values are implied by de-projecting from 
a particular redshift. Any scale-dependence in our measure of b and r can be ignored 
provided we average over scales large enough for linear theory to hold. Applying this model 
to the linear theory scale length deduced in section 4.2, we find b z q 1A = 2.51 ± 0.46 for 
u){6 > 2 arcmin). Of course this result is averaged over many different redshifts. 

To better quantify the evolution of QSO bias, we redo the analysis of section 4.2 in 
photometric redshift shells (see, e.g., Brunner, Szalay & Connolly 2000). Although diluting 
the statistics of the KDE data, this approach is attractive as narrowing the redshift range 
of any analysis allows more consistent scales to be de-projected. Though the dispersion in 
the photozs, as compared to objects with a spectroscopic redshift, is typically A ~ 0.1 —0.2, 
causing scatter between bins, this dispersion should merely introduce noise, provided the 
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redshift shells are large enough, i.e., larger than the typical Az phot . Given that bin sizes are 
restricted by Az p hot, there is motivation to improve QSO photoz estimation. 

An aspect of the photometric redshifts that might dilute clustering, by scattering quasars 
into entirely the wrong bin, is that some photoz estimates have likely solutions at several 
discrepant redshifts (these are sometimes called catastrophic estimates). Weinstein et al. 
(2004) quantify the probability of their photozs, so we could reduce our sample to those 
KDE objects that certainly do not have catastrophic redshift estimates. However, after the 
previous cuts we made to remove stars, only 7% of our sample have worse than a 50% chance 
of being in their estimated photometric redshift interval, and most of these objects have 
several secondary solutions, rather than a single clear alternative. We will thus discard no 
objects from our sample on the basis of photozs. To test this, we have repeated our analyses 
with a randomly chosen 7% of our sample assigned to a different photometric redshift bin, 
and find that results fluctuate negligibly on all scales, and well within the errors. The likely 
reason why catastrophic estimates do not adversely impact our analysis is our large bin sizes 
(Azphot), and there is thus motivation to reduce the number of catastrophic estimates while 
improving QSO photoz estimation. 

We split our (A g < 0.18) KDE sample into 4 photometric redshift bins, containing 
~ 16, 500 objects each, and calculate the angular autocorrelation in each bin (solid circles in 
Figure 8). We limit our main analysis to the range 0.4 < z p hot < 2.1, for several reasons. The 
data are UVX-selected, meaning the range 0.4 < z < 2.3 is approximately most sensitive 
to the QSO SED. Weinstein et al. (2004) suggest that their photoz estimation is best at 
0.8 < Zphot < 2.2 but we include lower redshift data in a bin from 0.4 < z p hot < 1-0 to increase 
the number of objects for our analysis (and to help compare quasar clustering to galaxy 
clustering). Dispersion in redshifts for 0.4 < z phot < 0.8 is larger than at other redshifts 
but not by enough to preferentially scatter objects to z phot > 1.0. To avoid probing too 
many scales in a single redshift bin we use a scheme that conveniently samples both similar 
scales and similar numbers of objects. As there remain significant numbers of A g < 0.18 
KDE objects (10882) at z p hot > 2.1, we measure their autocorrelation to provide additional 
constraints at high redshift (open circles in Figure 8). 

We consider fits to our data in Figure 8 out to 14, 22, 35, 56 and 89 h" 1 Mpc. In 
general, we do not fit on scales < 0.75 h~ l Mpc (at the median bin redshift), which should 
be in the fully non-linear regime. Given that Croom et al. (2005) find no evolution in the 
autocorrelation slope, we fit a single 7 for all redshifts. We use the slope at 9 > 2 arcmin 
displayed as the solid line in Figure 5. We also allow 7 to float as a free parameter, to 
demonstrate that there is sufficient degeneracy between measurements of slope and amplitude 
that fixing the slope does not unduly fix our results. When calculating 6q, we assume our 
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autocorrelations trace the underlying matter correlation slope (but not the amplitude) and 
that the matter correlation has r (z = 0) = 5 h^ 1 Mpc. Both assumptions are reasonable 
as the bias necessary to make ACDM match the galaxy autocorrelation is nearly linear (and 
certainly < 1.1 on scales < 20 h^ 1 Mpc; Jenkins et al. 1998), and r ~ 5 h^ 1 Mpc for galaxies 
(e.g., Baugh 1996). 

When de-projecting the correlation function we correct for imprecise photozs via the 
method of Brunner, Szalay & Connolly (2000). After splitting our sample into photometric 
redshift bins, we widen the derived dN/dz p h t by two one-tailed Gaussians 

= [ Zu Z2 ] e -{{*-[^M)/°} 2 . [o < z < Zu z > Z2 ] (15) 

(J. z 

affixed at either end of a bin of z p hoti < z phot < z phot2- Here, a is the dispersion between pho- 
tometric and spectroscopic redshifts, which we derive from KDE objects with spectroscopic 
matches (in DR1QSO, DR2 or the 2QZ), after a 2a clip to remove catastrophic photozs. As 
a check on this method, we repeat our analysis assuming dN/ dz in each bin of z p hot is given 
simply by all spectroscopic matches to KDE quasars (see Table 1 and Figure 9). 

In Table 1, we catalog the best-fitting models in photometric redshift bins. Also shown 
are the de-projected scale lengths for the real-space correlation function (assuming the matter 
autocorrelation evolves according to linear theory), and the derived quasar bias (assuming 
the matter correlation has a local scale-length of r (z = 0) =5 hr l Mpc). When the slope is 
fixed, and an appropriate dN j dz is used, a model where bq is both constant with redshift and 
linear (i.e. 6q = 1), is ruled out at high significance (> 99.9%, consistent with section 4.2). 
Note that any small error in the 5 h^ 1 Mpc scale-length assumed for local dark matter would 
not affect our conclusions, as consistently changing r introduces a systematic offset in 6q 
rather than noise. 

In Figure 9, we compare values of 6q from Table 1 to data from Croom et al. (2005). 
We find consistent results irrespective of the scale we fit (upper left panel of Figure 9). As 
the slope and amplitude of a power-law fit are degenerate, fixing 7 is not inconsistent with 
allowing 7 to vary as a free parameter. Allowing 7 to vary merely increases the error in our 
measurements of 6q and our results for 7 = 1.98 tend to the lower end of this increased error 
range, particularly at high redshift (top-right panel). Our data initially seem systematically 
lower than Croom et al. (2005); however, when the redshift distributions are widened by 
Gaussians, to reflect their photometric nature, 6q increases (lower-left panel). This technique 
of widening by Gaussians is consistent with determining dN/ dz from spectroscopic matches 
in each bin of photometric redshift (lower-right panel) and in either case our results, derived 
using photometrically-classified QSOs, are consistent with the results of Croom et al. (2005), 
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which were derived by directly measuring the real-space clustering of (a smaller sample of) 
spectroscopically-confirmed QSOs. Power-law fits to the data are reasonably acceptable at 
every redshift. For the bins with z p /j Ot =[0.75, 1.20, 1.53, 1.82, 2.22], respectively, P(< x 2 ) = 
[0.34, 0.76, 0.65, 0.83, 0.96] for the fits out to 89 hr l Mpc. Allowing 7 to vary as an extra 
parameter does not improve these fits, yielding P(< x 2 ) — [0.34, 0.72, 0.62, 0.86, 0.94] 

The mean quasar luminosity increases with redshift for a flux-limited sample. However, 
as discussed by Croom et al. (2005), UVX quasars track characteristic quasar luminosities up 
to z ~ 2 — 2.5. For example, for the redshift shells used; z p hot — [0.75, 1.20, 1.53, 1.82, 2.22]; 
the mean absolute magnitude (calculated as in section 4.4) is; M g = [—22.57, —23.63, —24.17, 
—24.49, —25.02]. At these redshifts, a "characteristic luminosity" for quasars (e.g., Croom et 
al. 2004; Richards et al. 2005) can be estimated as M* = [-23.81, -24.74, -25.23, -25.53, 
—25.75]. Thus the quasars we have considered are consistently around a magnitude fainter 
than M*. Ideally, in a large enough sample, we would separate the effects of luminosity 
from those of evolution by studying the quasar autocorrelation as a bivariate function of 
both luminosity and redshift. In the next section, we attempt this, testing if it is feasible to 
measure the evolution of QSO clustering as a function of intrinsic luminosity. 

4.4. Quasar Clustering as a Function of Luminosity and Photometric Redshift 

To determine absolute magnitudes (M g ) for KDE objects, we assume each photoz is, on 
average, a good estimate of redshift and use it to calculate M g from g-band magnitudes. We 
use a K-correction of if (0.4 < z < 2.2) = —0.42 — 2.5(1 + a) log(l + z), with (spectral index) 
a = —0.45 (i.e., f v oc v a ) from Wisotzki (2000), whom adopt a break at z ~ 0.4 to ensure K 
is zero locally, and suggest that this better approximates K(z) at high redshift. Ideally, we 
would study quasar bias in equal bins of M g , however, the large volume probed by quasars 
means that M g spans ~8 magnitudes, and high and low redshift bins do not overlap. It is 
thus difficult to fairly compare redshift bins without considering samples so small that noise 
dominates. Instead, we split the KDE sample into three photoz bins that contain equal 
numbers, then subdivide these bins into three in M g that contain equal numbers. We then 
measure the autocorrelation of each of these nine subsamples. 

Figure 10 shows 6q as a bivariate function of M g and redshift, derived as in section 4.3, 
assuming 7 = 1.98 and using Equation 15 for dN/dz. The implied values of bq are noisy 
but are marginally consistent at every redshift irrespective of absolute magnitude. We there- 
fore certainly cannot rule out the hypothesis that QSO clustering is independent of QSO 
luminosity. Also, It appears that evolution of the quasar population with redshift has a 
stronger affect on QSO clustering than changes in quasars' luminosity; and, as would be ex- 
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pected from section 4.3, we again find that QSOs are increasingly more biased with redshift, 
although this is rendered marginal by the lower numbers of objects in each bin. 

5. Discussion 

Implications of increasing QSO bias with redshift, and, more specifically, of the data 
plotted in the lower-right panel of Figure 9, have been discussed by Croom et al. (2005), 
who use the ellipsoidal collapse model of Sheth, Mo & Tormen (2001) to estimate the mass 
of the dark matter halo (DMH) in which quasars of given bias reside. Croom et al. (2005) 
find that UVX QSOs reside in halos of similar mass at every redshift (see also; Porciani, 
Magliocchetti & Norberg 2004; Grazian et al. 2004), and quote this mass as M^mh — 
(3.0 ± 1.6) x 10 12 /i _1 M Q . This derived Mdmh is locally consistent with the mass of an 
unbiased halo, M* (i.e. QSOs are locally biased similarly to L* galaxies). A constant Mdmh 
for quasars can be understood if M* and quasar bias evolve in step, so that QSOs are more 
biased at high redshift when M* is less massive. 

Interestingly, the fact that Mdmh for UVX QSOs does not evolve but that simulated 
dark matter halos merge and grow, suggests that objects observed in a QSO phase at high 
redshift must inhabit more massive dark matter halos by the present, and have turned off 
(or, more accurately, are no longer observed in UVX surveys, so are no longer in a UVX 
QSO phase). Thus the QSO phase cannot be long-lived. We are left with a picture where 
objects pass through a QSO phase when they inhabit dark matter halos of a certain mass. 
A likely scenario is that the UVX quasar phase is triggered by a merger between halos of a 
characteristic mass (or two galaxies embedded in a single halo of that characteristic mass), 
and then is limited by some process to a timescale shorter than that typical of additional 
mergers, which would create a more massive parent halo for the quasar. Meanwhile, smaller 
halos merge to eventually form more massive objects that harbor the correct conditions to 
ignite a QSO. As the UVX quasar phase is a limited one, we predominantly see UVX quasars 
at a time "close" to the merger that triggered them, and certainly, on average, before further 
mergers take place — hence we see UVX quasars in a single, average halo mass. Further, the 
black holes that fueled quasars visible at redshifts of 2 and above should now reside in massive 
halos, they should also, however, have no UVX accretion signature. 

Any short-lifetime model of the QSO phase must include a mechanism that can damp 
(UVX) quasar accretion processes on a timescale shorter than the typical merger rate. An 
obvious explanation is a natural limit to quasar fuel reserves, or a central engine that grows 
to a point where it produces sufficient radiation pressure to expel its fuel source (e.g., Silk & 
Rees 1998; Fabian 1999; Sazanov et al. 2005; Di Matteo, Springel & Hernquist 2005). There is 
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evidence (Hutchings & Neff 1992; Mihos & Hernquist 1996; Bahcall et al. 1997) that quasars 
result from a merger of two gas-rich galaxies of near equal mass. Recent physical models 
(Hopkins et al. 2005a,b) demonstrate how such galaxy mergers could lead to quasars with 
a brief peak in optical luminosity, and a short-lived UVX stage. If the mergers that cause 
UVX quasar activity occur between particularly massive, particularly gas-rich galaxies, then 
further mergers, once fuel is depleted in the Universe, would not necessarily reignite the UVX 
quasar phase (e.g., Hopkins et al. 2005c). This could explain why, on average, additional 
mergers fail to produce populations of UVX quasars residing in more massive halos at low 
redshift. However, models of galaxy formation must still explain why those mergers that 
initially trigger UVX quasars always occur within a halo (or two merging halos) with a total 
characteristic mass ~ 3 x 10 12 h~ 1 M Q (averaged over the population at a given redshift), and 
why mergers that occur in more (or less) massive halos do not produce quasars typically 
observed in UVX surveys. In the self-consistent galaxy-merger approach of Hopkins et al. 
(2005c) this characteristic mass is self-evident as the mass of those AGN that have a peak 
optical luminosity above the observable threshold. However, the luminosity distribution of 
quasars is empirically set in this model, so the equivalent question is why galaxy mergers in 
the Universe have led to quasars with the observed distribution of peak luminosities. 

We find that the luminosity of the QSO population at a given epoch bears no significant 
relationship to QSO bias, in agreement with results from spectroscopic surveys (see, e.g., 
Croom et al. 2005). As luminosity-independent QSO bias would suggest the mass of a 
quasar's parent halo is independent of the quasar's luminosity, the implications of such a 
relationship merit speculation. Given the M BH — a correlation (Ferrarese & Merritt 2000; 
Gebhardt et al. 2000) it is unlikely that the masses of the black holes (Mbh) that drive 
QSOs bear no correlation with the masses of the halos they reside in — but there could be 
a range of accretion efficiencies across the QSO population, with more luminous QSOs at 
every redshift having more efficient accretion process. Again, this schema naturally arises 
within the formalism of Hopkins et al. (2005c), and Lidz et al. (2005) have predicted the 
implications of that model, finding them to be consistent with the empirical fit from Croom 
et al. (2005) that we plotted in each panel of Figure 9. The models applied by Lidz et al. 
(2005) formally predict little or no luminosity dependence to quasar clustering, again broadly 
consistent with the work in this paper. However, we note that our data are currently sparse 
and noisy, and it remains to be seen definitively, from larger QSO samples, whether there is 
any luminosity dependence to the clustering of QSOs. We are engaged in producing a larger 
sample of photometrically-classified QSOs that we hope will formally start to constrain any 
quasar clustering as a function of luminosity. 
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6. Conclusions and Future Work 

We have used a large sample of photometrically-classified QSOs from DR1 to estimate 
the quasar autocorrelation as a function of luminosity and photometric redshift. We have 
demonstrated similar results using our "proof-of-concept" , photometrically-classified sam- 
ple as were obtained with the largest statistically-defined, spectroscopically-confirmed QSO 
sample available contemporaneously with DR1. Using photometric redshifts, we have con- 
firmed that quasar clustering shows little evolution, suggesting QSOs are increasingly more 
biased with redshift up to z ~ 2.2. We have attempted to measure the bivariate clustering 
amplitude of QSOs, finding that in a given redshift range there is no measurable dependence 
of QSO bias on QSO luminosity, and that evolution of the quasar population with redshift 
seems to have a stronger effect on QSO bias than changes in quasars' luminosity — however 
the errors on this measurement are still large enough to allow some overlap between redshift 
evolution and luminosity evolution, and a larger data sample (or improved techniques) will 
be necessary to make definitive conclusions. We speculate that QSO luminosity evolution is 
likely independent of mass, depending mainly on accretion efficiency, and have discussed the 
implications of this in light of the recent models of Hopkins et al. (2005a,b,c). 

We have confirmed that, for A g < 0.18, the KDE catalog of Richards et al. (2004) is 
contaminated by stars at only the 5% level, easily low enough that quasar clustering can be 
meaningfully studied. We have only used a fraction (~20%) of the eventual SDSS data, so the 
KDE technique should prove an impressive resource for quasar science. We note that there is 
room for improvement in the efficiency of the KDE algorithm at fainter magnitudes. There 
are firm scientific reasons for improving the classification of faint QSOs, not least of which is 
testing the evolution of QSO clustering across a significant range of QSO luminosity. Larger 
photometric samples and improved faint-end classification will increase the overall number 
of photometrically-classified QSOs, which will both improve the significance of multivariate 
estimation and facilitate quasar autocorrelation estimation with improved angular binning 
resolution. Larger samples and better classification will not, alone, be sufficient to increase 
binning resolution in redshift space beyond that used in this paper, as the typical photoz 
dispersion is currently comparable to the bin size. It is thus useful and necessary to also 
improve QSO photometric redshift estimation. We are currently engaged in producing a 
larger catalog of photometrically-classified quasars using improved techniques with novel 
priors and believe that QSO clustering measurements will soon be repeated with ~500,000 
quasars that are even more efficiently classified. 
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Fig. 1. — The autocorrelation of all data from the KDE catalog with error comparison. The 
points plot the autocorrelation estimated using Equation 1, which we have independently 
verified are in excellent agreement with what is measured using the approach of Scranton et 
al. (2002). We plot jackknife errors but have also computed Poisson and pixel-to-pixel errors. 
The lines plot the ratio of Poisson to jackknife errors (dotted), and the ratio of pixel-to-pixel 
to jackknife errors (dashed). 
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Fig. 2. — The effect of seeing on the KDE QSO autocorrelation. The autocorrelation of all 
KDE objects is plotted for a range of seeing cuts. Seeing in this plot is measured in the 
g band. Note that the most liberal cut of 1.8 arcsec or better is effectively the same as 
making no seeing cut. All errors are jackknifed. Points have been offset within each bin to 
aid clarity. 
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Fig. 3. — The effect of absorption by Galactic dust on the KDE autocorrelation. The upper 
panel shows the KDE autocorrelation binned by galactic absorption in g, with ~20,000 
objects in each bin. Clearly absorption of A g > 0.18 introduces spurious power on large 
scales. The lower panel repeats the analysis, in the same bins of A g , for objects that are 
both faint and highly reddened, by considering objects observed fainter than 21 in any SDSS 
band (i.e. with no correction to magnitude for Galactic dust). There are ~5,000 objects per 
bin. Faint, highly-obscured objects introduce large-scale clustering and are the main culprit 
in causing the effect for A g > 0.18. Objects observed with magnitudes > 21 that are not 
heavily obscured by Galactic dust display no such effect. All errors are jackknifed. Points 
have been offset within each bin to aid clarity. 
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Fig. 4. — To search for magnitude-dependent stellar contamination in the KDE catalog we 
consider, in the upper panel, the autocorrelation of (A g < 0.18) KDE QSOs as a function of 
g magnitude. The KDE data have been divided into 6 bins of approximately equal numbers, 
containing ~12,600, ~14,100, ~9,700, ~11,400, ~13,300 and ~15,000 QSOs from brightest 
to faintest, respectively. In the lower panel, the plotted points represent la upper limits on 
the stellar contamination (1 — a in Equation 4). These limits are derived using u>ss estimated 
from star-like objects in DR1 that have magnitude in the range 16.9 < g < 17.1 (plotted as 
crosses in the upper panel), and taking ujqq from all (A g < 0.18) KDE objects. Points have 
been offset within each bin to aid clarity. All errors are jackknifed. 
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Fig. 5. — The autocorrelation of A g < 0.18 KDE objects. Points are plotted with jackknife 
errors, and binned logarithmically. The best-fitting power-law model across "all" scales 
(0.04 < 9 < 250 arcmin) is displayed as a long-dashed line. There is marginal evidence for a 
break in the power-law at ~25 arcmin. The lines display the best-fitting power-law model for 
the scales over which they are plotted; 0.04 < 9 < 1 arcmin (dotted line), 1 < 9 < 25 arcmin 
(short-dashed line) and 2 < 9 < 250 arcmin (solid line). A scale of 10 arcmin is ~3.5 h^ 1 Mpc 
at the median redshift of the (A g < 0.18) KDE sample. 
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Fig. 6. — The (normalized) redshift distributions for KDE objects (with A g < 0.18). The 
solid line combines the redshifts of spectroscopic matches to the KDE catalog from the 2QZ, 
DR1QSO and DR2. Any objects that appear in multiple catalogs are assigned a redshift 
in the order DR1QSO-2QZ-DR2. The dot-dash line is the histogram returned assuming 
that the photometric redshifts in the KDE catalog are exact (binning is coarser to reflect 
imprecision in these estimates). 
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Fig. 7. — Evolution of the real-space correlation scale length. The models discussed in sec- 
tion 4.2 are normalized to the best-fitting amplitude and slope for the KDE autocorrelation 
(the solid line in Figure 5). These models are compared to spectroscopic measurements of 
the quasar autocorrelation from Croom et al. (2005) in the upper panel and the autocorre- 
lation of bright galaxies from Carlberg et al. (2000) in the lower panel. In both cases the 
cosmology is chosen to match the spectroscopic data. The linear theory models are plotted 
with la error bars. Note that the tq{z) scale is logarithmic. 
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Fig. 8. — The KDE autocorrelation in photometric redshift bins. In each bin, the solid 
circles are our standard A g < 0.18 KDE sample (~16,600 quasars per bin), the solid line is 
the best-fitting power-law with 7 = 1.98 (as determined in Figure 5) and the dashed line 
is the best-fitting power-law with 7 allowed to float as a free parameter. In the lower-right 
panel, we also plot a high redshift bin, which is equivalent to everything with z P hot > 2.1 
from the KDE catalog — the open circles that represent these data are offset slightly. The 
model fits to the open circles are not plotted, as they are very similar to those for the 
1.7 < z p hot < 2.1 bin (see Table 1). All errors in this plot are jackknifed. 
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Fig. 9. — Derived QSO bias evolution as a function of photometric redshift. The upper-left 
panel shows that estimates are consistent irrespective of the scale over which the correlation 
function is fit. The upper-right panel demonstrates that the results are consistent whether 
the slope is fixed at 7 = 1.98 (the best-fit slope at r > 0.75 Yr x Mpc for the full sample) or 
allowed to be a free parameter. The lower left panel mimics the upper-right but demonstrates 
the effect of widening the redshift distributions by Gaussians (see Equation 15). The lower- 
right panel compares the effect of widening the distribution by Gaussians to an alternate 
approach, determining dN/dz p hot from spectroscopic matches in each z p hot bin. The crosses 
are data from Croom et al. (2005) and the solid lines plot &q(z) = 0.53 + 0.289(1 + z) 2 (the 
empirical fit they derived from these data) with la error ranges (the dashed lines). 
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Fig. 10. — Bivariate KDE QSO autocorrelation by redshift (z p hot) and absolute magnitude 
(M g ). Our standard A g < 0.18 KDE sample is split into 3 redshift bins containing equal 
numbers and then further split into three M g bins of equal numbers (resulting in ~8100 
quasars per bin). The labels show the mean g apparent magnitude, mean M g , implied QSO 
bias (6q) and x 2 likelihood for each autocorrelation fit, while the dotted lines show the best- 
fit power law. Low \ 2 likelihoods could be improved by rebinning outliers without changing 
the fit. Errors in this plot are jackknifed. 



Table 1. Estimates of the QSO bias, b Q , from the amplitude of the de-projected 
correlation function in photometric redshift bins (with median redshift z p hot)- 



Method/Scale at z phot bin z phot = 0.75 z phot = 1.20 z phot = 1.53 z phot = 1.82 z phot = 2.22 



Observed Amplitude, A (arcmin 1 7 ) a 



U./O \ r <. 14 Al ivipe 


n n/io 4- n n^Q 
U.U4y ± U.Uoo 


U.zoo it U.UOO 


U. 1 i ( ± U.Uoo 


n i on _i_ n n^/i 
U.loU it U.U04 


U.140 it U.UoZ 


U./O \ / \ ZZ fl ivipe 


U.UDU HI U.UOl 


u.zoo it u.uoo 


U.loO It u.uoo 


U.1UZ It U.UOO 


U.14y It U.UoU 


fl Vc; <-" y, o c f, — 1 A/Ti-i/. - 1 
U. ID S r s. 00 fl IVlpC 


U.UOz ± u.uou 


n oon -U n n^9 
U.zZU it U.UOz 


n ini _i_ n n^9 
u.iyi it u.uoz 


n iri _i_ n n^9 
U.1U1 it U.UOz 


u.ioi it u.u<y 


0.75 < r < 56 ft" 1 Mpc" 1 


0.058 ± 0.049 


0.218 ± 0.051 


0.190 ± 0.052 


0.157 ± 0.051 


0.152 ± 0.079 


0.75 < r < 89 h' 1 Mpc -1 


0.059 ± 0.049 


0.218 ± 0.051 


0.190 ± 0.052 


0.158 ± 0.051 


0.154 ± 0.079 


7 not fixcd b ° 


0.020 ± 0.015 


0.30 ± 0.17 


0.13 ± 0.09 


0.28 ± 0.18 


0.39 ± 0.38 


De-projected ro 












0.75 < r < 14 A- 1 Mpc- 1 


3.8 ± 2.1 


9.5 ± 1.1 


7.5 ± 1.2 


7.5 ± 1.2 


9.3 ± 2.7 


0.75 < r < 22 ft" 1 Mpc" 1 


4.2 ± 1.9 


9.4 ± 1.1 


7.6 ± 1.1 


7.2 ± 1.2 


9.4 ± 2.6 


0.75 < r < 35 h,- 1 Mpc" 1 


3.9 ± 1.9 


9.1 ± 1.1 


7.7 ± 1.1 


7.1 ± 1.2 


9.5 ± 2.5 


0.75 < r < 56 h' 1 Mpc" 1 


4.1 ± 1.8 


9.1 ± 1.1 


7.7 ± 1.1 


7.1 ± 1.2 


9.6 ± 2.5 


0.75 < r < 89 ft" 1 Mpc" 1 


4.2 ± 1.7 


9.1 ± 1.1 


7.7 ± 1.1 


7.1 ± 1.2 


9.6 ± 2.5 


dN/dz from spectra b c 


6.7 ± 2.8 


11.1 ± 1.3 


13.0 ± 1.8 


12.2 ± 2.0 


15.8 ± 4.1 


dTV/dz widened by Gaussian* 3 d 


6.0 ± 2.5 


11.7 ± 1.4 


10.8 ± 1.5 


10.3 ± 1.7 


14.4 ± 3.7 


7 not fixcd b e 


2.8 ± 1.4 


9.3 ± 2.4 


7.0 ± 3.0 


7.7 ± 2.2 


9.9 ± 3.8 


7 not fixed, dN/dz widcned b d e 


4.8 ± 2.4 


11.74 ± 3.1 


10.0 ± 4.0 


10.8 ± 3.1 


13.46 ± 5.23 


Derived 6q 












0.75 < r < 14 ft- 1 Mpc" 1 


0.75 ± 0.41 


1.88 ± 0.22 


1.49 ± 0.23 


1.50 ± 0.23 


1.85 ± 0.53 


0.75 < r < 2 h- 1 Mpc" 1 


0.84 ± 0.35 


1.87 ± 0.21 


1.52 ± 0.22 


1.43 ± 0.23 


1.88 ± 0.51 


0.75 < r < 35 ft" 1 Mpc" 1 


0.78 ± 0.38 


1.81 ± 0.21 


1.54 ± 0.21 


1.42 ± 0.23 


1.89 ± 0.50 


0.75 < r < 56 h' 1 Mpc" 1 


0.82 ± 0.35 


1.80 ± 0.21 


1.54 ± 0.21 


1.41 ± 0.23 


1.90 ± 0.49 


0.75 < r < 89 h' 1 Mpc" 1 


0.83 ± 0.34 


1.80 ± 0.21 


1.54 ± 0.21 


1.41 ± 0.23 


1.91 ± 0.49 


diV/dz from spectra b c 


1.34 ± 0.56 


2.20 ± 0.26 


2.58 ± 0.35 


2.42 ± 0.39 


3.12 ± 0.80 


dTV/dz widened by Gaussian 15 d 


1.19 ± 0.49 


2.32 ± 0.27 


2.15 ± 0.29 


2.05 ± 0.33 


2.84 ± 0.73 


7 not fixcd b e 


0.65 ± 0.24 


1.92 ± 0.54 


1.40 ± 0.50 


1.64 ± 0.53 


2.34 ± 1.14 


7 not fixed, d7V/dz widcncd b d ° 


0.97 ± 0.36 


2.47 ± 0.69 


1.90 ± 0.70 


2.38 ± 0.76 


3.46 ± 1.68 



a Throughout this table, unless otherwise noted, the assumed slope is 7 = 1.98. 
b This fit uses the "0.75 < r < 89 h^ 1 Mpc" definition for the scale. 

c Assumes the redshift distribution from spectroscopic matches (to DR1QSO, DR2 or the 2QZ) in each z p ^ ot bin. 

d Assumes the redshift distribution from photometric rcdshifts widened by Gaussians (see Equation 15). The measured 
dispersions between spectroscopic and photometric redshifts are a = 0.221,0.114,0.154,0.118,0.287 for each listed bin, 
respectively, after applying a 2a clip to remove catastrophic photometric redshift estimates. 

c These fits assume no slope for 7, instead taking the best fit from the data. 



